A New Communication and Computation Overlapping Model with Loop Sub-Partitioning and Dynamic Scheduling
نویسندگان
چکیده
The latency hiding techniques can significantly improve the performance of the parallel programs in distributed memory systems. This paper presents a communication and computation overlapping model to hide the communication latency in data parallel programs. The communication and computation overlapping model makes use of the loop subpartitioning scheme in which a given loop partition is partitioned again into the finer partitions called loop subpartitions. One of the loop sub-partitions can be effectively overlapped with communication which is incurred by the rest of the loop sub-partitions. This loop sub-partitioning scheme can hide latency effectively within the scope of a single loop, while the previous latency hiding techniques require several independent loops or computations. Communication incurred by each loop sub-partition has dynamic behavior, so it is difficult to predict the completion time in MIMD machines. In our model, the loop subpartitions are dynamically scheduled according to the completion of their communication. This dynamic scheduling technique is based on the dependence graphs which represent the relationship between message receptions and loop sub-partitions. The preliminary results for IBM SP2 with LAPI show that our model significantly improves the performance.
منابع مشابه
A Threshold Accepting Algorithm for Partitioning Machines in a Tandem Automated Guided Vehicle
Abstract : A tandem automated guided vehicle (AGV) system deals with grouping workstations into some non-overlapping zones , and assigning exactly one AGV to each zone. This paper presents a new non-linear integer mathematical model to group n machines into N loops that minimizes both inter and intra-loop flows simultaneously. Due to computational difficulties of exact methods in solving our pr...
متن کاملA New Approach in Job Shop Scheduling: Overlapping Operation
In this paper, a new approach to overlapping operations in job shop scheduling is presented. In many job shops, a customer demand can be met in more than one way for each job, where demand determines the quantity of each finished job ordered by a customer. In each job, embedded operations can be performed due to overlapping considerations in which each operation may be overlapped with the other...
متن کاملDynamic Task Partitioning Model in Parallel Computing
Parallel computing systems compose task partitioning strategies in a true multiprocessing manner. Such systems share the algorithm and processing unit as computing resources which leads to highly inter process communications capabilities. The main part of the proposed algorithm is resource management unit which performs task partitioning and co-scheduling .In this paper, we present a technique ...
متن کاملNew scheduling rules for a dynamic flexible flow line problem with sequence-dependent setup times
In the literature, the application of multi-objective dynamic scheduling problem and simple priority rules are widely studied. Although these rules are not efficient enough due to simplicity and lack of general insight, composite dispatching rules have a very suitable performance because they result from experiments. In this paper, a dynamic flexible flow line problem with sequence-dependent se...
متن کاملProgramming Research Group A SCHEME FOR THE BSP SCHEDULING OF GENERIC LOOP NESTS
This report presents a scheme for the bulk-synchronous parallel (BSP) scheduling of generic, untightly nested loops. Being targeted at the BSP model of computation, the novel parallelisation scheme yields parallel code which is scalable, portable, and whose cost can be accurately analysed. The scheme comprises three stages: data dependence analysis and potential parallelism identiication, data ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007